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Abstract 



Ambiguity is shown in the context of the differential calculus of several 
variables and with the help of the language of category theory, a way to 
solve it in its most general form is offered. It is also shown that this new 
definition is related to other well-known definitions in the literature. 
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E[xi{t),...,Xn-l{t),t]= deiE[r{t),t], E{xi,...,Xn-l,t) = def^(r, t) 

is usually not remarked in the literature, and for this reason we can often write down 
meaningless symbols like: 



pir,t). (2) 
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Ambiguities in the "notation" for partial differentiation has been remarked by Arnold 
[1] p. 226 (p. 258 in English translation) without further development. The symbols (1), 
(2) are meaningless, because the process denoted by the operator of partial differentiation 
can be applied only to functions of several independent variables and E[r{t),t] is not such 
a function. Meanwhile, the operator of total differentiation with respect to given variable 
can be formally applied to functions of one variable only. However, we have a well-known 
formula to relate both concepts: 

^^E = (V.V)E^Ie (3) 

(here V = f ). 

Let us show that, in this form, Eq. (3) cannot be correct. What is the correct 
argument for the symbol E in both sides? If we say that the correct argument for both 
sides is [r(t),t] we get the chain of symbols (1), but in this case, the operator of a partial 
differentiation would indicate that we must construct a new function in the form (dE/dt), 
hence we use the following procedure: 



. E 
lim 



T{t) + At'^,t + At]-E[T{t),t]^ 



At^O 1 At 

But this is the definition of total differentiation! Thus, the symbols of total and of partial 
differentiation denote the same process, therefore, because E is the same function on both 
sides of the equation, we get: 

{Y-V)E[rit),t]=0 (5) 

always. But even if the procedure which we followed were correct (which it is not, of 
course!), this equation is not correct for E as a function of the functions r(t), because the 
partial differentiation would involve increments of the functions r(t) in the form r(t) + 
Ar(t) and we do not know how we must interpret this increment because we have two 
options: either Ar(t) = r(t) — r*(t), or Ar(t) = r(t) — r(t*). Both are different processes 
because the first one involves changes in the functional form of the functions r(t) , while the 
second involves changes in the position along the path defined by r = r(t) but preserving 
the same functional form. Hence, it is clear that we have here different concepts. If we 
remember the definition of partial differentiation, we can see where the mistake is: "t/ie 



symbol: -^E{r,t) means that we take the variations oft when the values ofv are constanf . 
It means that we make the only change t + At in the function. But this is only possible 
if the coordinates r are independent from t. Hence, we can see that the correct argument 
cannot be [r(t), t], because, as we have shown, this supposition leads to the incorrect result 
(5). If we make the other supposition, that the correct argument is (r,t) we can get the 
same conclusion, i.e., equation (5). Hence, none of these suppositions is correct. What is 
the solution, then? Actually, in the equation (3) we have two different functions: on the 
left hand side we have the function E[r{t),t] defined on a curve in a n-surface and on the 
right hand side we have the function E{r,t) defined on the a// n-surface, which obviously 
are quite different functions, while we have a limiting procedure to get a unification of 
concepts in the realm of functions of one variable. 
Now let us introduce the following notation: 

f = Eop, (6) 

where the symbol "o" means a composition of functions and where 

E : R — ^ R, p '■ R — ^ R 1 f '■ R — ^ R- 

It is clear that p = p(t) = {xi(t), . . . ,x„_i(t),t} = {r(t),t} is a curve which lies on the 
n-surface where the function E is defined. 
Hence we can write down the equation: 

at Xi~*xi(t) [ at J 

which shows our point more clearly: the functions in both sides (f and E) are different 
functions. Of course, we suppose that the components of the vector V tend to derivatives 
^ in the limit. But here is where our grammatical distinction appears: the right hand 
side is evaluated in all points along the curve p(t), that is: 

(V . V)E +^ 

Xi=Xi{t) (jt Xi=Xi{t) 

Let us explain the distinction as follows: the operator of the total differentiation is 
just a differentiation of a function which can depend on one independent variable, and the 
operator of the partial differentiation is just a partial differentiation of a function which 



can depend on several independent variables. An obvious question immediately arises: 
what is the relation between these domains'? Obviously, the function of one variable is 
one entity and the function of several variables is a different one. The relation lies in the 
evaluation of the function obtained by partial differentiation in points along the curve 
p(t). Or in more general terms, we must have the validity of the following condition: for 
all £ > there is a 6[e, p(t)] > such that if we take a point in the ball: 

\r-p{t)\<6[e,p{t)], 



where f = {xi, . . . , x„_i, t} = {r, t}, then 

dE' 



{(V-V)E}(r,t) 



+ 



dt 



[r,t) 



V-V)E}[p(t)] + 



dE' 
'dt 



[P(i)] 



<e, 



where, of course, E = E{r) = E{r,t). 

We have not supposed, of course, that we have an uniform continuity. The abbreviated 
form of this condition is: 



^^(«' = ,'iS„|(^-^)^c-'«' + l^(^-4- 



(7) 



The distinction betweenQ£'(r, t) and E[r{t),t] is important in some physical contexts, 
as it is shown in [2] (see, especially, Eq. (28)). The grammatical distinction is that the 
realm of functions of one independent variable is not the same as the realm of functions 
of several independent variables, and that the relation between these two realms is given 
by a limitation procedure. 

II. SOME REMARKS RELATED TO THE FUNCTIONAL EQUATION (6) 

What conditions must the relation f = E o p satisfy to make sense? It is obviously 
that all its elements /, E, and p have to exist, and we, in fact, must write down the more 
general relation: 

f{t) = \imE{r,t). (8) 



^or, that is the same, between E[r) and i?[p(t)] 



It means that the function E must be continuous in all points of the curve p. 
We have to consider seven case^: 

1. Two functional form E and p are known: This is the {-E,p}-case; 

2. E and / are known: {E, /}-case; 

3. p and / are known: {p, /}-case; 

4. Only E is known: {ii^}-case; 

5. Only / is known: {/}-case; 

6. Only p is known: {p}-case; 

7. All function are unknown: {}-case. 

In the {E,p}-, {E, /}- and {p, /}-cases we can define one of the functions in terms 
of the other two functions, for example, in {E, /}-case we define p etc. In {E}-, {/}- 
and {p}-cases one can show that it is possible to define the other functions under certain 
conditions. Let us make a brief review of these classes. 

{E}-ca.se: In this case, we only know the form of E, and we need to define 
the forms of the other two functions. We suppose that: E G C^{R^,R), 
p G C^{R,R'^), f G C^{R,R). Now we write down our defining equation in 
the form 

df , (^^^^^^^dE dE] 
and we propose the following two equations: 

^hn^ «"^ (b)vKf)=g%^. (10) 

where hij is a skew-symmetrical matrix {hij = —bji). This proposition has the 
following motivation: we define the components of the vector field V by (10b) 
then, when we put this equation in (9), the first term on the right hand side 
vanishes and we get the equation (10a). This is not yet enough. We construct 



^When we know all three functions, we must only check that the relation (7) is valid. This is 
trivial. 



the curve as an integral curve of the vector field with the components (10b), 
i.e., the solution of the following set of equations (a non- autonomous system 
of differential equations): 

dx, !^\ dE 

Now with the solution of the equation (11) we have an explicit form of the 
curve p. And we know E, hence we know its partial derivatives. Then for the 
function / we can write down: 



/^/(i^f; 



dt + const. (12) 



So, with just the form of E we can define the form of the other two functions. 
{p}-case: We just know the form of the curve. However, for this case 
we require the following conditions: E E C^{R'^,R), p G C'^{R,R'^'), f G 
C^{R, R). We shall follow the same methodology used in {E}-case. We know 
the explicit form of the curve p, hence we know its derivatives in an explicit 
way. We use here a symbol ki{t) = dxi/dt to denote these explicit functions. 
We have the following two equations from the defining relation: 

df f)W BF' "~^ 

(a) — = lim — - and (b) ^— = XI bijki{t), (13) 

at r^p dt oxi ~{ 

where bij = const for all i,j. In this case we have supposed that the compo- 
nents of the vector field, in the limit, are equal to the functions ki{t). The 
solution to these equations is: 



n—l „ I n— 1 



dT , 
-—dt, 

----- ■ - ^^ 



^ = E hMt)^^ + T{t) and / = I lim I X ^'^^""^ ( "^^ + / 



where T is an arbitrary function. In this case we have solved, first, the equation 
(13b) and its solution E is used to calculate the partial derivative with respect 
to t. Then we have calculated the limit to get the integrand to calculate /. 
Again, with just one entity, the curve, we can define the other two functions 
in the functional equation (6). 

{/}-case: We just know the form of /. The defining relation is written as: 
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{n— 1 Q771 ftTP^ 

For this case we propose the following strategy (again we define the curve as 
an integral curve of the vector field Vj): 

(a) (V.) ^ = H{t), (b) H{t) E £ + i^ = m- (15) 

Hence the curve has the form Xi = J H{t)dt, {i = 1, . . . ,n — 1). The function 
E is determined by a first order partial differential equation of a certain special 
form (Eq.(15b)). 

One may think that the way in which we have solved the problems is artificial because 
we introduced ad hoc vector fields in the reasoning. This is not really the case, it is just 
the effect of our rigid vision of the process of solution. 

Consider, for example, the Poincare-Cartan 1-form of classical mechanics: 



W = ^Pidqi - Hdt. 



1=1 



We do not have the right to write down it as W^ = dS, where S is the action, until we 
prove that it is in fact an integrable 1-form. With this purpose in mind we can attack 
the problem in the following way: we suppose that the form is integrable and we write 
Pi = dS/dqi, H{pi,qi) = —dS/dt, and we get the Hamilton- Jacobi equation. Hence, the 
problem of integrability is the problem of the existence of solutions of the Hamilton- Jacobi 
equation. As it is well-known, an analytic solution for this equation always exists locally 
(Cauchy-Kovalevsky theorem), hence, the 1-form is a locally integrable 1-form. In the 
dynamical problem we know the Hamiltonian explicitly; but we know neither the form of 
the curve nor the action as a function of the coordinates (not as a functional, because that 
is another point of view). But, as it is well-known, if we can solve the Hamilton- Jacobi 
equation we know the action and the solution of the dynamical problem by means of a 
canonical transformation generated by this action function. Clearly, in this case we have 
introduced all our "auxiliary functions", the action and the Hamiltonian, to know the 
explicit form of the curve in phase-space. Of course, we have required some data: the 
form of the Hamiltonian and the supposition of integrability of the 1-form. And from the 



theoretical point of view, it is enough to construct the solution of the dynamical problem. 
However, we need to make our distinction in this point: the action as a function of the 
coordinates differ from the function constructed by restriction of the action to the curve. 
Another important point becomes clear when we use 1-forms: in all the cases which 
we have treated, we need to suppose the integrability of a 1-form. For example, when we 
treat the {-E}-case we start from the 1-form: 

^dE , dE , 

dE = y ^r—dxi + —-dt 
i oxi dt 

which is clearly integrable. Hence, we want to know a curve as an integral curve of a 
vector field which we define as: 

X-V6 ^A ^ 
T"' '■^ dxj dxi dt ' 

The inner product of these two tensors (the pairing between the tangent and cotangent 

space) give us the result: 

dE 
{dE,X) = —{xi,...,t), 

hence, the composition is in fact, the result of taking the limit of the inner product in 
the integral curves of the vector field X. We can treat the other cases from this point of 
view, but that is easy after this explanation. In a geometric interpretation we have the 
following elements: the tangent vectors, and the angle between them. In the {E}-csise 
we have the normal, but we have neither the tangent nor the angle; in the {p}-case the 
tangent, but we have neither the normal nor the angle; finally, in the {/}-case we have 
the angle, but we have neither the tangent nor the normal. 

Now let us make a brief review of the last case ({}-case). 

The point is that in this case we have no any data and to treat it we need some 
information. Heyting [3] notes that we ought to distinguish two different concepts: 

1. Theories of the constructible. 

2. Constructive theories 

The first one is characterized by 3 conditions: 

(a) we presuppose a mathematical theory in which the class of constructible 
objects can be defined; 

8 



(b) the notion of a constructibility is no primitive; 

(c) we have a hberty to choose the definition of a constructible, But, of course, 
it must correspond to our intuitive notion of a mathematical construction. 

For the second point (the constructive theories) Heyting says: "a theory in which an 
object is only considered as existing after it has been constructed. In other words, in 
a constructive theory there can be no mentioning of other than constructible objects" . 
The main feehng of Heyting is expressed in the following sentence: "J am unable to 
give an intelligible sense to the assertion that a mathematical object which has not been 
constructed exists." 

In the case which we want to treat we have no any data concerning the equation 
f = E op. Hence, if we accept that we can only speak about those objects which can be 
constructed explicitly (or, at least, we have a method to construct them), the case which 
we are treating, the {}-case, is not even a case. It is nothing, it is just a line of symbols 
without any meaning. For this reason when one speaks about the functional equation 
f = E o p one, in fact, is speaking about the cases considered before: {E, p}-, {E, /}-, 
{P, /}-, {E}-, {/}-, and {p}-case. 

As a last remark we can see that we have shown that the generally accepted expressions 
of the type of Eq.(3) cannot be valid. 

III. ABOUT FUNCTIONAL EXTENSIONS 

We shall give our problem the most general setting. Let us start with a topological 
space D, so that it is possible to construct the general object of arrows: T{D, K) where 
T is any covariant functor. Hence we can construct the functor: 

T(D, *) : Ci ^ C2, (16) 

where Cj(i = 1,2) are any small categories. Then for each arrow we have / G T{D,K) 
the diagram: f : D ^ K. For us, the following situation is the most important: the set 
D is an object with a given structure, so we use the symbol P{D) to denote its power 
set (which is a topology, of course, any topology is a subset of the power set, but not 
any subset of the power set is a topology). In this way, for each element in P{D) we can 
define the following elements: (/^, A) for all A G {P{D). Here the symbol {fA,A) means 

9 



that the object A G P{D) is put in correspondence with the function Ja- So we may form 
the set: 

Fn = {{fA,A)\AeP{D)} (17) 

of functional elements. 

It is clear that this procedure has been realized in a somewhat formal manner, how- 
ever, this is the more general form. As we can see, there are several elements which are 
important for our construction: the covariant functor T, the object D, its power set P{D), 
the set of elements F^, which is the part in which the functions enter the discussion and 
the two small categories: Ci, C2. 

Definition 1: We shall call the symbol {Fd,T, Ci, C2, P{D)) a general function. 

The idea behind a general function is that all its elements are different for each element 
of the power set of D. Sometimes we can use a specific topology instead of the power set, 
but this choice relies on our convenience. Besides, we can see that in general, any topology 
is just a subset of P{D). The formation of a topology in the object D can respect its 
structure or not. We use the categorical notions to introduce the generality which they 
carry, because in general a function depends on the categories in which it is defined, see 
[4] chap. 1, for more details. Hence, the general setting is: how are the different elements 
of a general function related? The problem may seem trivial without more elaboration, 
however, as we have seen in the introduction, in some realms the problem is not trivial. 
Let us give a few additional examples. 

Example 1: Consider the following example (see [5]), which is clearly not trivial, 
suppose the following choice: D = C where C is the complex plane, if we use the symbol 
Anal to denote the functor of the set of complex analytic functions we have: 

Anal(C, *) : Set -^ Set (18) 

or, to be more concrete, the arrows: / : C ^ C of complex analytic functions are at hand. 
We must consider the power set -P(C) of the complex plane and the construction of the 
elements: (/^, A) for each element of the power set. This is the most general situation for 
the choice that we have made of our basic elements. In this setting we have the following 
group of well-known definitions [5]: 

10 



Given two "functional elements" f{A) = dc{{fA,A), f{B) = dc{{fB,B), we can 
say that we have a direct analytic prolongation if, and only if, the following two 
conditions hold: 

AnB^$ 

fA = fB in AriB 

So, we can see that in general, the problem of analytic continuation is a realization, 
in the complex domain, of our definition of a general function 
Example 2: Consider the functors: 

C{D, *) : Top -^ CRng and C*{D, *) : Top -^ CRng (19) 

from the topological spaces to the rings of continuous functions. The functor C* is for 
bounded functions. Here the problem is as follows: a set S is C-embedded if, and only 
if, every function / G C{S) can be extended to a function g G C{D). Here S G D and 
C{S) is an abbreviation of C{S, S). The idea here is that the extension is a C-function. 
The definition of C*-embedding is similar. 

One of the most important characteristics of the C*-embedding is Uryshon's lemma: 

A subset S of the set D is C*-enibedded in D if, and only if, any two completely 
separated sets in S are completely separated in D. 

We can see that this lemma is just an assertion about functional extensions, that is, 
a theorem about the way in which the elements of a general function are related [6], p. 
18. Here the set F^ can be constructed once we have fixed the topology of the spaces S 
and D, or at least the base of the topology. If we use the power set we have a conceptual 
generality, but we can fall into troubles for some purposes. Let us take for topology of D 
its power set t{D), hence, the set Fd can be formed and we have: 

F = {Fd, C, Top, CRng, t{D)) (20) 

as our general function for this case. Of course, we can construct F without recourse to 
the Uryshon's lemma, however, this result gives us a way to relate two elements of the 
general function F. 

Example 3: Let us come back to the example in the introduction. Consider the 
functor: 
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C^iR"", *) : Vect -^ Vect (21) 

so the arrows: / : i?" — > i?" where R is the real hne. The power set is now P{R^''), and the 
set of functional elements is: {{fA,A)\A G PlR"")}. The notion of differentiability does 
not change and we can define the derivative of a general function as the general function 
formed with the derivatives of the functional elements of the starting general function. If 
one of such elements is not differentiable, the general function is not. 

IV. THE LIMITING PROCEDURE 

Let us explain in more detail the limiting procedure which can be used for the elements 
of a general function. Consider the initial object D and suppose a partition of the form: 

n 

D=[JG, (22) 

Hence, the general function is defined with the help of the elements of the set Fd = 
{{fi,Gi) >} and the functor T{D, *). We can make this decomposition in many ways. 
For example: D = {JAetiD) A, where t{D) is the power set of D. 
Now we define the system of sets: 

P, = {AeP{D)\{G,cA}. (23) 

In other words: the set of all the sets A so that the set Gi is contained. Is very easy to 
show that each Pi is a filter. 

Lemma: Each Pi is a model of a filter in D (see [6] p. 24). 

Proof: O (a) We can see that is not an element of Pi, any i, because if G Pj then 
we can find a set A such that A C which is a contradiction, hence we have proved the 
first axiom, (b) If we suppose that A,B E Pi then A{~\ B E Pi because, at least A and 
B have the set Gi in common, hence Gi is in their intersection, but this is the condition 
for belonging to Pi. The second axiom is satisfied, (c) \i A E Pi, B d D and A C i? it is 
very easy to see that B E Pi. The lemma is proved. O 

This lemma (such trivial as it is) is important, because with a filter we can define a 
limit for the elements of a general function. In fact, given the filter Pi of the element i 
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in the partition, we have a function /j which maps the element G*j. Clearly, the elements 
fi{Gi) are the images of the set D by the general function F^. So, we define the filter of 
the image of D by the general function F^ as Fo{Pi) = {A G K\fi{Gi) C A}. This is 
clearly a filter. With these elements it is possible to set up a well-known definition of the 
limiting procedure for the elements of a general function. 

Definition 2: A set Gi is the limit of a filter H if, and only if, H is stronger than 
the filter Pi. 

Definition 3: Consider the filter Pi, hence the set ^ C i^ is the limit of the 
general function Fo under the filter Pi if, and only if the set A is the limit of the 
filter F£){Pi). That is, without abbreviations: the filter Fo{Pi) is stronger than the 
filter formed with the sets that contain A. 

We say that a filter H is stronger than the filter B (of course, both filters defined on 
the same space) if, and only if, for any a & B there is a set 6 G // such that h d a. Of 
course, this is just the notion of approximation, because a filter H is stronger than a filter 
B if their elements are nearest to a certain set than the elements of B. Now let us use 
this concept for the example in the introduction. We have the equation: 

1^,= toJ(V.V)£. + |£.}. (24) 

where E^ is a function along the curve 7 and E^ is a function defined on the set A. Now, 
let us give a precise meaning to the process involved. We have the general function Ed 
and two of their functional elements are involved: {E^, 7), (i5^. A) where 7 and A are sets 
in D. Hence we can see that the limiting procedure affects only the functional element 
((V ■ V)-Ea) A) so, we do the following: we select a set d C 7 and we form its filter P^ , 
so, a set -B G -R" in the image of the functional element (V ■ V)-Ea is its limit if, and only 
if the filter formed with the sets that contain the image of (V ■ V)-Eyi is stronger than 
the filter formed with the sets that contain B. Of course the extension of this definition 
covers the usual e-5 arguments. 
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V. CONCLUSIONS 

As promised in the introduction, we have solved in its most general form the ambiguity 
which arises in the differential calculus of several variables with the help of category theory. 
Besides we have showed several examples of realizations of our construction. 
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